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Abstract 

The problem of how to place students in a sequence of hierarchically related 
courses is addressed from an (empirical) Bayesian point of view. Based on a 
minimal set of assumptions, it is shown that optimal mastery rules for the courses 
are always monotone and a nonincreasing function of the scores on the placement 
test. On the other hand, placement rules are not generally monotone but have a 
form depending on the specific shape of the probability distributions anc utility 
functions in force. The results are further explored for a class of lines; utility 
functions. 
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Some Decision Theory for Course Placement 

A course placement problem is met when students with different aptitudes enter an 
educational program offering classes in the same domain at several entrance 
levels. The typical solution of the problem is to administer an aptitude test to the 
students to decide at which level they should start. It is the purpose of this paper 
to demonstrate how (Bayesian) decision theory could be used to optimize such 
placement decisions. 

For simplicity, the case of two courses is considered where the first 
course has a lower entrance level than the second course. It is assumed that an 
aptitude test is administered to decide whether the students have to take the first 
course or can go straight to the second course. In addition, it is assumed that the 
first course ends with a mastery test to decide whether those students taking the 
first course have learned its content and can be admitted to the second course. A 
flowchart of the problem displaying the temporal relationship between the tests and 
courses if given in Figure 1 . 



(Figure 1 about here] 

To derive optimal decision rules for the placement and mastery decisions, 
(Bayesian) decision theory is a natural framework. Examples of a decision- 
theoretic treatment of the mastery problem are found in Hambleton and Novirk 
(1973), Huynh (1976, 1977, 1982). Huynh and Perney (1979), van der Linden 
(1980, 1990). and van der Linden and Mellenbergh (1977). The placement 
problem has been addressed in Sawyer (submitted) and van der Linden (1981). 
The novelty in this paper is that optimal rules for the two decisions problems are 
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derived simultaneously. The advantages of a simultaneous approach have been 
spelled out in Vos (1994,1995). For the present problem, they will be discussed 
later in this paper. 

Notation 

Sampling of students from a population P is assumed. The observed 
scores on the aptitude or placement test, administered before the first course, will 
be denoted by a random variable X. Likewise, the observed score on the mastery 
test is denoted by Y whereas the true score on this test is denoted by T. For ease 
of exposition, the variables are assumed to be continuous. 

For each possible value of the observed score, the decision to assign the 
students either to Course 1 or to Course 2 has to be made. A placement rule can 
therefore be denoted either by the set of x values A.={x:Course 1} or its 
complement A^xiCourse 2). Likewise, a mastery rule can be represented by the 
set B 1 (x)={y:x,Fail) or its complement E^fxHyxPass}. Note that the mastery rule 
is allowed to depend on the score on the placement test, X=x. This type of 
mastery rule presents the most general approach. For example, it is possible that 
for candidates in Course 1 with a high score on the placement test a different 
mastery rule is optimal than for candidates with low scores. The idea to allow 
mastery rules to depend on the scores on a previous placement test is congenial 
with the idea developed in Bayesian statistics that the use of collateral information 
generally improves the quality of decision making. On the other hand, as will be 
shown below, constraining optimal rules B^(x) or B 2 (x) to be the same sets of y 
values tor each possible value x requires additional conditions on the test score 
distributions 
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Distributional Assumptions 

It is assumed that if all students in P are assigned to Course 1, the trivariate 
distribution of their scores (X,Y,T) can be represented by a density function 
f 1 (x.y.t). If, on the other hand, all students are assigned directly to Course 2, the 
distribution is assumed to be represented by a density function f 2 (x,y,t). For future 
reference, it is understood that the conditional distributions of T given (X=x,Y=y), T 
given X=x, Y given X=x, and (Y,T) given X=x are denoted by Pj(tlx,y), q|(tlx), 
hj(ylx), and rTy.t | x), i=1 ,2, respectively. The marginal distribution of X is denoted 
by gj(x). Without loss of generality, it is assumed that all density functions are 
nonnegative everywhere. As usual, cumulative distribution functions will be 
denoted by capitals. 

The following assumptions about f ( (x.y,t) seem plausible: 



g,(x)=g(x). i=1.2; (1) 

P,(tlx,y) is decreasing in x and y for all values of t, i=l,2; (2) 

H^ylx) is decreasing m x for all values of y, i=1,2: (3) 

Qjlllx) is decreasing m x for all values of 1. 1=1,2. " (4) 



The first assumption is realized if the placement test is administered before either 
course is taught--a condition automatically met in regular course placement 
problems The last three assumptions require the conditional distributions of the 
observed and true mastery test scores to be stochastic increasing in their 
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conditioning variables. These conditions can be expected to be met if the 
placement and mastery tests are well designed, that is, if they are constructed 
such that high scores on the placement test tend to coincide with high observed 
and true scores on the mastery test. In van der Linden and Vos (in press), it is 
proved that conditions as in (4) follow from the assumptions in (2)-(3). 

Several other assumptions could be specified to define a course 
placement problem with well-designed tests and courses. For example, for the 
conditional cumulative distributions functions of (Y,T) given X=x, R|(y,tlx), it seems 
reasonable to expect that: 

Rj(y,tlx) increases in i for all values of x, y, and t. (5) 

This condition states that the probability of high observed and true scores on the 
mastery test given a score on the placement test is larger for students assigned to 
Course 1 than tor those assigned directly to Course 2. In other words, no matter 
the aptitude scores of the students, following Course 1 will tend to have a positive 
effect on their observed and true mastery of its subject matter. However, this and 
other obvious properties are not needed to derive the results presented in this 
paper. For a more general treatment of the properties of stochastic order in 
multivariate distributions of test scores, see van der Linden (submitted). 



Utility Structure 

The usual approach is to define the utilities involved in mastery decisions as a 
function ot the true score underlying the mastery test. Since mastery decisions are 
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made only tor students assigned to Course 1, it seems logical to also view the 
utilities involved in the outcomes of this decision as a function of the true mastery 
score. As for Course 2, the position is taken that the decision to assign students 
directly to this course can be viewed as an advanced mastery decision based on 
the aptitude test as a predictor of the students’ mastery scores after Course 1. It 
follows that the true mastery score can be used as the common criterion to 
measure the utilities involved in all decisions in the simultaneous placement- 
mastery problem addressed here. More in particular, the following utility functions 
are defined: 



Assignment to Course 1 (x e A 1 ) and Fail (y e B 1 (x)): 


u 1 (t): 


Assignment to Course 1 (x e A 1 ) and Pass (yeB 2 (x)): 


u 2 (t); (6) 


Assignment to Course 2 (x e A 2 ): 


u*(t). 


As for the shape of the utility functions, it is assumed that: 




u.ltl is monotomcally decreasing in t: 


(7) 


u 2 (!) and u *(!) are monotomcally increasing in t. 


(8) 


Condition (7) can be defended pointing at the fact that the decision to have a 
student fa>i after Course 1 has less utility, the higher the true mastery level of the 
student The reverse is true for the other two possible decisions; both for the 
actual mastery and the ‘advanced mastery' decision, it should hold that the 
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outcome has larger utility, the higher the true mastery level of the student. 
However, the actual mastery decision involves the additional costs of teaching 
Course 1 which are missed for the 'advanced mastery' decision. For traditional 
group-based instruction, these costs can be assumed to be a constant 
independent of t Hence, 

u *(t)>u 2 (t) for all values of t, (9) 

As yet, no parametric form will be assumed for the utility functions. The 
intention is first to further explore the features of the problem as well as the shape 
of its optimal decision rules. Then a parametric form for the utility functions will be 
introduced. In doing so, two additional assumptions about the nature of the 
difference between U 2 O) and u *(t) will be made. 



Formalization of the Problem 

Tne condition on the utility functions in (9) suggests that if utilities were the only 
concern, the best decision would be to always assign the students to Course 2. 
This action represents the largest utility since the constant costs involved in 
teaching the first course are missed However, if the decisions had to be based 
only on the test score distribut ons, the best decision would always be to assign 
students to Course 1. As the condition m (5) shows, no matter the score of the 
students on the placement test, the probability of having high observed or true 
mastery of the subject matter taught in this course increases when attending it. 
Therefore, if mastery of Course 1 is a prerequisite for success in Course 2, 



BEST COPY AVAILABLE 



Course Placement 
8 



students who attend Course 1 will tend to be more successful in Course 2. 

Obviously, a criterion is needed to solve this dilemma between utility and 
probability. In this paper, the Bayes criterion of maximum expected utility is used. 
For the utility functions and distributions defined above, the expected utility 
associated with the simultaneous problem considered in this paper can be written 
as: 



E[U(Ai .B-j (x))] = f{ J j"u 1 (t)f 1 (x,y,t)dtdy + [ [* U 2 (t)f i (x.y.t) dt dy}dx 

A^xfR 62 (x)R 

+ f f [u*(t)f 2 (x,y,t)dtdydx. (10) 

A 2 R R 

It is assumed that the integrals in (10) converge. Formally, the problem is now to 
find sets A 1 and B^x) for which the expression in (10) is maximal. 

Moving to posterior densities and taking expectations, (10) can be written 
as: 



E[U(A 1 .B 2 (x))] = \ {E [u i (T) ix] - E 2 [u *(T) |x] 

Al 

+ | E 1 (u 2 (T) - u^T) |x.y]h 1 (y )x)dy}g(x)dx 

B 2 (x) 

+ E 2 [u *(T)], (11) 

where the subscripts at the expectation signs index the distributions with respect to 
which the expectation is taken. The problem is now to find sets A 1 and E^x) for 
which (11) is maximal. Note that the last term of (11) is a constant independent of 
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A 1 and E^fx) and may be ignored. Since the sample spaces of X and Y are not 
constrained in any way, the optimal sets can be found by first maximizing the 
second term over all possible sets of y values and next maximizing the sum of the 
first two terms over all possible sets of x values. 



General Solution to the Problem 

The general results in this paper are presented in the form of two propositions. 
The first proposition states that under the conditions given above the optimal 
mastery rule is always a cutting score on the mastery test score Y which is a 
decreasing function of the placement test score X=x. The second proposition 
states that the form of the optimal placement rule does not necessarily have the 
form of a cutting score on the placemen! test. 

Proposition 1 . Under conditions (1)-(3) on the posterior distributions of T given 
(X=x,Y=y) and Y given X=x, it holds for the utility functions in (7)-(8) that the 
optimal sets E^fx) are intervals (y c ix). ®), where y c (x) is a decreasing function of 
x. 

Proof Prom (7j-(8i it follows that uplti-u^t) is increasing in t whereas the condition 
in (2) states that tne distribution of T given (X=x,Y=y) is stochastic increasing in 
the two conditioning variables From a well-known theorem for such distributions 
(Lehmann, 1986. p i 16), it follows tnat t^UofO-u^T) | x,y] increases in x and y. 
Let x(x.y) denoti this expectation, and consider the relation {(x,y)lx(x,y)=0). This 
relation defines a function y=y c (x) which is decreasing in y (van der Linden & Vos, 
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in press, Lemma 4). Hence, for each x there exists a value y (x) such that x (x,y) 

V 

is nonnegative for y>y c (x) but negative for y<y c (x). Since the integral of x (x,y) 
over h 1 (ylx) is maximal if it is taken over all values of x for which the integrand is 
nonnegative, it follows that the optimal sets B 2 (x) are the intervals (y c (x),«]. ■ 

Proposition 2 . Linder the conditions stated above it does not hold generally that the 
optimal set A 1 is monotone, that is, there always exists a cutting score x Q such 
that A.|=(-°-=,x c ] is optimal. 

Proof . From (4) and (7) it follows that E 1 (u^ (T)lx) decreases in x whereas (4) and 
(8) imply that E 2 (u *(T)lx] increases in x. Therefore, the integrand in the first term 
in (11) decreases in x. Following an argument in van der Linden and Vos (in 
press), it is now proved that the integrand in the second term in (11) is 
nondecreasing in x. Substituting B 2 (x)= (y c (x),«) into (11), since x(x,y) is 
nonnegative for y>y c (x) and y c (x) is decreasing in x, it holds for any x 2 >x 1 that 

| Ttx 2 .y)h 1 (y ;x 2 )dy - | T(x 1 .y)h 1 (y jx^dy > 

y c i x 2> y c i x i' 

f 'C(x2.y)h 1 (y|x 2 )dy - | X(x 1 ,y)h 1 (y |X 1 )dy > 

y c {x i> y c > x i) 

| t(x 1 .y)(h( 1 yx 2 ) - h^y.x^jdy = 

y c < x i> 

| <p(y)l h i(y! x 2 ) - h iiy |X i)) d y- ( 12 ) 

— DO 
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where <p(y) s 1 [y c (x-| ),oo)(y) T ( x i .y). and l[y c ( Xl ),oo)(y) is an indicator function 
which takes the value 1 if y e [y c (x 1 ),») and the value 0 otherwise. By 
definition, tp(y) is a nondecreasing function of y, and from (3) it follows that the 
second term in (11) is nondecreasing indeed. Now it depends on the rate of 
change of the first and second terms in (11), and hence on the specific form of the 
utility functions and score distributions whether the integrand in the integral over x 
in (11) is a monotonically decreasing function of x. Therefore, the optimal set A 1 
does not generally take a monotone form. ■ 

Proposition 2 implies that for some utility functions and distributions, an 
optimal set A 1 may consists of two or more non-adjacent intervals of placement 
test scores. Generally, it does not hold that if a student with a certain aptitude is 
assigned to Course 1, the same should apply for all students with lower aptitudes. 
The result goes against the prevailing practice of using placement rules in the form 
of a single cutting score on a test. The explanation of this 'anomaly' is the 
dilemma between utility and probability referred to in the previous section. Utilities 
and probabilities are different quantities. If for a given aptitude level it is known that 
the expected increase in mastery due to attending Course 1 offsets the increase in 
utility involved in directly attending Course 2, no inferences as to other aptitude 
levels are possible without knowing the specific forms of the utility functions and 
probability distributions. 

Calculation of Optimal Rules 

From the proof of Proposition 1 it follows that the optimal mastery function 
is the decreasing function y=y (x) defined by 
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E 1 [u 2 (T)-u 1 (T)|x.y] = 0. (13) 

Substituting y c (x) into (11), maximal expected utility is obtained if A 1 is taken to be 
the set of x values for which the integrand of the outer integral is nonnegative. 
That is, the optimal set A 1 is given by: 

A 1 = (x: Ei(ui(T)|x] - E 2 [u*(T)|x] 

+ J E 1 [u 2 (T)-u 1 (T)|x,y]h 1 (y|x)dy > 0). 

YcW (14) 

The optimal set in (14) can be found using a suitable method of numerical 
integration. As in practice observed test scores are always discrete, a simple 
approach is to substitute all possible test scores into the expression in (14) to find 
the subset for which it is nonnegative. However, to be able to do so, the 
regression functions in (14) must be estimated. The following section shows how 
to deal with the problem if linear utilities and regression functions can be assumed 
to hold 

Linear Utility Structure 

For the utility functions in (6). the following linear structure is adopted: 



Ui(t) = b-j (t c -1) ♦ . b^O. 


(15) 


u 2 (t) = b 2 (t-t c W a 2 . b 2 > 0 . 


(16) 
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u*(t) = b*(t-t c )+ a*, b*>0, (17) 

where t c is assumed to be a threshold value on the true mastery scale which 
separates true masters from true nonmasters. If such a threshold does not exist, 
the above functions should be reparameterized absorbing t into the intercept 
parameter. Note that the slope parameters are required to be larger than zero to 
meet the conditions in (7)-(8). Also, generally, a^, a 2 > a <0 because these utility 
constants represent the constant parts of the costs involved in teaching Course 1 
and/or Course 2. It is reminded that this paper views the decision to assign 
students to Course 2 as an advanced mastery decision. Hence, utility functions 
(16) and (17) have the same parameter structure but may have different values for 
their parameters to deal with the specifics of the application. An example of the 
use of these utility functions will be given later in this paper. Techniques to elicit 
utility functions for placement decisions have been addressed in Sawyer (1994, 
April). 

Substituting utility functions (1 5)-(l 7) into the expression for the expected 
utility in (11) and using the result in Proposition 1: 

E[U(A 1t y c (x))) = f((b 1+ b*)t c + (a 1 -a , )-b 1 E 1 (T|x)-b*E 2 (T|x) 

Ai 

c>o I 

* | [-(b 1 *b 2 )t c ♦ 02 -a^ ♦ (b 1+ b 2 )E 1 (T|x.y)h 1 (y|x)dy)g(x)dx. 

y c < x > (i8) 

From the general solution in (13) it follows that the optimal mastery function is 
defined by. 
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-(b 1 +b 2 )t c + (a 2 -ai) + (b^ +b 2 )Ei(T|x,y) = 0, (19) 

whereas (14) implies that the following set of x values is optimal: 

A 1 = (x: (b 1+ b*)t c + (a 1 -a*)-b 1 E 1 (T(x)-b*E 2 (T|x) 

oo + 

J ( -(b-j +b 2 )t c + (a 2 -a-|) + (b-| +b 2 )E 1 (T|x l y)h 1 (y|x)cJy > 0). (20) 

y c w 

Linear Regression 

If linear regression of T on X and Y can be assumed, that is, if it holds 
that 

El(T|x,y) = ajxYi + PtxYI* + YTXYiy- ( 21 ) 

the optimal mastery function is: 

y c(x) = (>C - ( a 2 -a 1 ^b-| +b 2 ) -cijxyiIYjxyi “ (PtXYIYjXYI^' 

Note that this function is linear in x with a slope parameter equal to -PjxYYtxY' 
Since families of conditional distributions with the property in (2) have nonnegative 
covariances between the random variable and the conditioning variables (van der 
Linden, submitted), it follows that Pjxy anc( Yjxy are nonne 9 a *' ve ' ar| 9 thus that 
the function in (22) is nondecreasing indeed. 

As classical test theory shows: 
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Ej(Tlx) = Ej(Ylx). 

The former can thus be written as: 

Ej(Tlx) = a Y xi + PyXi x - ( 23 ) 

Substituting (21) and (23) into (20), the optimal set A 1 is: 

A 1 = (x: K-j -(b^Yxl + b*PyX2) x 

+ [ [K 2 + (bi + b2)(PjYxi x + ttyx 1 y ) h 1 (y ! ( 24 ) 

Yc( x ) 

with 

Ki = bi (t c - Ctyxi ) + b * (t c - 0tyx2^ + ( a 1 - a *) ■ (25) 

K 2 = (b^ + b 2 )(ayxYi - t c ) + (82 •• a-j ) . (26) 

and where y c (x) is given by ( 22 ). 

Classical test theory shows that the regression parameters in (22) and 

(24)-(26) are simple functions of the mean of Y, the variances of X and Y, the 

correlation between X and Y, and the reliability of Y. If X and Y are standardized, 

the relevant expressions are ayxi = 0, Pyxi = PxYi- a TXYi = 0. PTXYi = 
2 2 2 
PXYi^ “PYYiW " P XYi^ ’ and YTXYi ” 'PYYrPxYi ^ 1 _p XYi^ ( Lor ^ ^ ^ovick. 

1968, chaps. 2 and 12). 

Substituting the above results into (22), the following form for the optimal 
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mastery function is obtained: 



y c (x) - (t c -(a 2 -a 1 )/(b 1 +b 2 ))( — j— ) 

PYY1 -Pxyi 



.PXYlO-PYYl), 

( __)x. 

PYYI'Pxyi ( 27 ) 



The same substitution into (24) gives: 

Ai = {x; «i - K 3 x + C (K 2 + K 4 x + K 5 y)h ^ (y |x)dy > 0), (28) 

with 



K 1 = (b^ + b*)t c + a^ - a*, 


(29) 


K 2 = -(b-| + b 2 )tp + a 2 - a 1 , 


(30) 


k 3 = b 1 PXYI + P *PXY 2 ■ 


( 31 ) 


2 

K 4 = (b 1 + b 2 )p X Yi (1 - PYYlVl 1 “ PxY1>' 


(32) 


2 2 

K5 = (b^ + b 2 )(PYYi - Pxyi^^ - Pxyi ^ ' 


(33) 



To use the above decision rules the conditional density functions h^y | x) 
have to be estimated from the data. Fitting a well-chosen parametric form for these 
densities is an obvious approach to the estimation problem. If h^y | x) is chosen 
to be the normal density function, an appropriate change of variable in the integral 
and use ol the fact that for the c.d.f.. <I>(u), it holds that |0(u)-[1 +exp(1.7u)] _1 1 



fj 



(f 
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< 0.01 uniformly in u (Lord & Novick, 1968, sect. 17.2) show that (28) is equal to: 



Obviously, the second and the fourth term in left-hand side of the 
inequality in (34) are monotonically decreasing in x. Since the slope of y (x) in (27) 
can never be positive, the second factor of the third term is also monotonically 
decreasing in x. However, the first factor of this term is increasing in x. Thus it 
depends on the relative acceleration of these two factors whether or not the left- 
hand side of the inequality is monotonically decreasing in x and allows for an 
optimal placement rule of the form A 1 = (-», x c ]. 



A few numerical examples were calculated to shed some light on the behavior of 
the optimal placement rule and mastery function in (34) and (27) for several values 
of their parameters. The following six cases were addressed: 



A 1 = (x: «i - «3 x + («2 + I 



+ K 7 exp (-. 




1 _p XY 1 



with 



K 6 = K 4 + PXY 1 K 5 - 



(35) 



K ? = (27t)~ 1/2 (1-px Y1 )K 5 . 



( 36 ) 



Numerical Examples 
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Case 1: Standard parameter values . Correlations between cognitive 
predictor variables and college grades typically range from 0.50-0.60 (Etaugh, 
Etaugh & Hurd, 1972; see also Schoenfeldt & Brush, 1975). A choice of 0.55 for 
PXYi seems therefore realistic. The same authors found an average reliability for 
the grade-point average over a large variety of programs equal to 0.689. Since the 
average number of courses per program was 5.23, the reliability of a single grade 
is generally lower. On the other hand, tests for remedial courses typically are more 
carefully constructed than tests for regular courses. Based on these 
considerations, a choice of pyyt =0.60 was made. The cutting score t_ was set 

V/ 

equal to 0.5, which is a value 0.5/ ^.60 standard deviations larger than the 
average score on the mastery test. The values of the utility parameters a^, a 2 > and 
a * in (15)-(17) had to be selected relatively to the scale of t. It was assumed that 
the costs involved in additionally teaching Course 1 were equal for the students 
who failed and those who passed the mastery test, that is, 3^2. On the other 
hand, sending students directly to Course 2 does involve only the costs of teaching 
Course 2. Therefore, utility parameter a * was set larger than the common value 
of a 1 and a 2 - As for the values of the elope parameters b 1 and b 2 in (15)-(16), 
failing the mastery test for high values of t was assumed to be less serious than 
passing the test for low values. This choice is justified if the program is structured 
such that a retest is offered to those students who fail the test but those who pass 
are allowed to continue without a further check of their true mastery levels. Hence, 
b 1 <b 2 As the placement decision is considered as an advanced mastery decision 
in this paper, the value of its slope parameter was set equal to value for the actual 
mastery decision, that is, b 2 =b * . In summary, the following set of standard values 
for the parameters m (27) and (34) was assumed: 
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PXV1 = PXV2 =055: 

PYyi =0.60; 
t c =0.50; 
a 1 =a 2 =*l.O; 
a*=-0.50; 
b 1 =1 .0; 
b 2 =b *=1.2. 

Case 2: Higher costs of teaching Course 1 , It was assumed that the 
constant costs involved in the teaching of Course 1 were higher than the standard 
value in the previous case. Therefore, the common value of the parameters a 1 and 
a 2 was lowered to -1.50. All other parameters were kept at their standard values. 

Case 3: Utility less sensitive to true mastery level . To simulate the case of 
the utilities associated with the various decision outcomes being less sensitive to 
differences between the true mastery scores, the values of the three slope 
parameters were halved: b^ =0.5 and b 2 =b*=0.6. All other parameters were kept 
at their standard values. 

Case 4: Adaptive instruction . If Course 1 is an individualized course in 
which the intensity of instruction is adapted to the aptitude level of the students, 
the savings in instructional costs involved in assigning students directly to Courses 
2 may decrease as a function of t. To simulate this case, the values of b^ and b 2 
were raised to 1.2 and 1.4, respectively, but all other parameters were kept at their 
standard values still. 

Case 5: Higher predictive validity of placement test . Next, it was 
supposed that a better placement test was available than the test studied in Case 
1. Therefore, pxyi was raised to 0.70. The height of this value must be judged 
relatively to the reliability of the mastery' scores which was kept at its standard 
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value (as were the values of all other parameters). 

Case 6: Higher reliability of mastery test . To assess the effect of a higher 
reliability of the mastery scores on the optimal placement and mastery rules, 
PYyi was raised to O.f.5, leaving all other parameters at their standard values. 

Figure 2 gives the plot of the optimal mastery function y (x) as a function 
of 



[Figure 2 about here] 

x. It appears that the function decreases by an amount of 0.74 for an increase of 
the placement score X equal to one standard deviation. In the second part of 
Figure 2, X. (x) is the left hand-side of the inequality in (34) plotted as a function of 
x. To find the roots of 



[Table 1 about here] 

Mx)=0, Newton's method, as implemented in Mathematica l FindRoot command 
(Wolfram, 1993) was used. Table 1 lists the results for the present and the 
following cases. Only one root was found, at x=0.592. Thus, a monotone 
placement rule exists for this case. 

The plots for all other cases are given in Figure 3. For higher costs of 
teaching Course 1 (Case 2), the root of X(x) moved down to x=0.094, which, as 
expected, implies 
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h 61 S “ en,s a,e ass « nHi 10 Course 1. For utnife less sens#ive t0 
changes rn the tree mastery level , (Case 3), X(x) was again found to cross the 
onzontal axrs only once, this time at xr0.092. Making the instruction in Course I 
adaptive to the mastery levels ol the students (Case 4) resulted in the existence ol 
two cutting score on the placement, one at x=0.654 and another at x=5.455 Given 
the d, sir, but, on ol X assumed in this example, the second cutting score can be 
ignored lor practical reasons A higher validity of the placement test lo, the 
examinees assrgned to Course 1 (Case 5) resulted in an optimal masteiy (unction 
running much sleeper because the prediction ol the mastery based on the scores 
could now be based on more valid placement scores. The placement rule was 
nonmonotone again; two cutting scores were lound, one at x=0.442 and the other 
al x=2.778 implying that examinees with lower and very high placement scores 
had to be assigned to Course 1 and the others to Course 2. Finally, lo, a higher 
reliability ol the mastery lest lo, the students assigned lo Course 1 (Case 6), the 
relative impedance ol the placement test as a predictor ol the true masteiy level 
wen; down substantially, and the optima, masteiy function became a more 

or, rental lunclron ol the placement score X(x) now crossed the horizontal axis 
only at x=0.746. 



Concluding Remarks 



Cne ol the advantages of optimizing placement and mastery decisions 
simultaneously ,s mat more realistic utility structures are possible. For example the 
point ol view taken ,n this paper that placement decisions are in tact advanced 
mastery dec, sons could never have been translated into a utility structure ,1 the 
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two decisions were treated separately. Another advantage is that the placement 
scores can be used as collateral information to improve the mastery decisions. The 
example showed that for a linear utility structure the behavior of the optimal 
placement rule and mastery function was realistic for various changes in the 
sensitivity of utility to true mastery level, the adaptivity of the instruction as well as 
the validity and reliability of the placement and mastery tests, respectively. 

The question can be raised under what conditions the optimal cutting 
score on the mastery test is a constant independent of the score on the placement 
test rather than the decreasing function defined by (13). From (11) it immediately 
follows that if 



h^(ylx) = h^(y), for all x, (37) 

the inner integral is only a function of y and the mastery rule becomes independent 
of X=x. If (37) is satisfied, it holds that {3jxy -0- anc * ,he °P ,ima| mastery function 
under linear regression in (27) reduces to its first term which is a constant indeed. 
However. (37) represents the case in which the aptitude test nas no predictive 
validity whatsoever as to the scores on the mastery test. I! is doubted whether 
such tests could ever play a significant role in placement decisions. In a sense, it 
has thus been demonstrated in this paper that placement decisions can only be 
based on valid aptitude tests if they involve optimal mastery rules which are a 
decreasing function of the aptitude scores. 
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Table 1 

Roots ol Mx) lor Cases 1 through 6 



Case 



Roots of X(x) 



1 

2 

3 

4 

5 



0.592 

0.094 

0.092 

0.654 5.455 

0.442 2.778 



6 



0.746 
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Figure 1 . 



Figure 2 . 



Figure 3 . 



Figure Captions 

Flowchart of a two-course placement problem with 
a mastery test. 

Optimal placement rule and mastery function for 
standard set of parameter values (Case 1). 

Optimal placement rules and mastery functions for 
Cases 2 through 6. 
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